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\ G. SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT; Darrell Anderson, Nabil Hanna. John Leonard, 

Roland Newman and Mitchell Reff and William H. 
Rastetter 



(ii) TITLES OF INVENTION: THERAPEUTIC APPLICATION OF 

CHIMERIC AND RADIOLABELED 
ANTIBODIES TO HUMAN B 
LYMPHOCYTE RESTRICTED 

15 \ DIFFERENTIATION ANTIGEN FOR 

TREATMENT OF B CELL LYMPHOMA 

(iii) NUMBER OF SEQUENCES: 8 
20 (iv) CORRESPONDING ADDRESS: 



25 



(A) 


ADDRESSEE: 


IDEC Pharmaceuticals Corporation 


(B) 


STREET: 


11011 Torreyana Road 


(C) 


CITY:\ 


San Diego 


(D) 


STATE: 


California 


(E) 


COUNTRY: 


USA 


(F) 


ZIP: \ 


92121 



(v) COMPUTER READABLE FORM: 

(A) MEDIUMATYPE: Diskette, 3.5 inch, 1.44 Mb 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: MS.DOS 

(D) SOFTWARE: Microsoft Word 5.0 

(vi CURRENT APPLICATION DATA: 



(A) APPLICATION NUMBER: 

(B) FILING DAT! 
40 (C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Burgoon, Richard P. Jr. 

45 (B) REGISTRATION NUMBER: 34,787 

(C) REFERENCE/DflCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 

50 (A) TELEPHONE: (619)550-8500 

(B) TELEFAX: (619)560-8750 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) I LENGTH: 8540 bases 

(B) I TYPE: nuolaic aoid 

(C) 1 STRANDEDNJSSS: single 

(D) I TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPpTHETICAL: 

A 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GACGTCGCGG 


CCGCTCTAGG 


CCTCCAAAAA 


AGCCTCCTCA 


CTACTTCTGG 


AATAGCTCAG 


60 


AGGCCGAGGC 


GGCCTCGGCC 


TCTGCATAAA 


TAAAAAAAAT 


TAGTCAGCCA 


TGCATGGGGC 


120 


GGAGAATGGG 


CGGAACTGGG 


CGGAGTTAGG 


GGCGGGATGG 


GCGGAGTTAG 


GGGCGGGACT 


180 


ATGGTTGCTG 


ACTAATTGAG 


ATGCATGCTT 


TGCATACTTC 


TGCCTGCTGG 


GGAGCCTGGG 


240 


GACTTTCCAC 


ACCTGQTTGC 


TGACTAATTG 


AGATGCATGC 


TTTGCATACT 


TCTGCCTGCT 


300 


GGGGAGCCTG 


GGGACTTTCC 


ACACCCTAAC 


TGACACACAT 


TCCACAGAAT 


TAATTCCCCT 


360 


AGTTATTAAT 


AGTAATuAAT 


TACGGGGTCA 


TTAGTTCATA 


GCCCATATAT 


GGAGTTCCGC 


420 


GTTACATAAC 


ttacggiUaa 


TGGCCCGCCT 


GGCTGACCGC 


CCAACGACCC 


CCGCCCATTG 


480 


ACGTCAATAA 


TGACGTATGT 


TCCCATAGTA 


ACGCCAATAG 


GGACTTTCCA 


TTGACGTCAA 


540 


TGGGTGGACT 


atttacgqta 


AACTGCCCAC 


TTGGC AGT AC 


ATCAAGTGTA 


TCATATGCCA 


600 


AGTACGCCCC 


CTATTGACpT 


CAATGACGGT 


AAATGGCCCG 


CCTGGCATTA 


TGCCCAGTAC 


660 


ATGACCTTAT 


GGGACTTTCC 


TACTTGGCAG 


TACATCTACG 


TATTAGTCAT 


CGCTATTACC 


720 


ATGGTGATGC 


ggttttggAa 


GTACATCAAT 


GGGCGTGGAT 


AGCGGTTTGA 


CTCACGGGGA 


780 


TTTCCAAGTC 


TCCACCCCAJT 


TGACGTCAAT 


GGGAGTTTGT 


TTTGGCACCA 


AAATCAACGG 


840 


GACTTTCCAA 


AATGTCGTAA 


CAACTCCGCC 


CCATTGACGC 


AAATGGGCGG 


TAGGCGTGTA 


900 


CGGTGGGAGG 


TCTATATAAQ 


CAGAGCTGGG 


TACGTGAACC 


GTCAGATCGC 


CTGGAGACGC 


960 


CATCACAGAT 


CTCTCACCATl 


GAGGGTCCCC 


GCTCAGCTCC 


TGGGGCTCCT 


GCTGCTCTGG 


1020 


CTCCCAGGTG 


CACGATGTGA 1 


TGGTACCAAG 


GTGGAAATCA 


AACGTACGGT 


GGCTGCACCA 


1080 


TCTGTCTTCA 


TCTTCCCGCC 


lATCTGATGAG 


CAGTTGAAAT 


CTGGAACTGC 


CTCTGTTGTG 


1140 


TGCCTGCTGA 


ATAACTTCTA 


TCC C AG AG AG 


GCCAAAGTAC 


AG TGG AAGGT 


GGATAACGCC 


1200 


CTCCAATCGG 


GTAACTCCCA 


GGAGAGTGTC 


ACAGAGCAGG 


ACAGCAAGGA 


CAGCACCTAC 


1260 
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CCATTGACGT CAATG< 



AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AG AAAC AC AA AGTCTACGCC 1320 

TGCGAAGTCA CCCAJTCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 13 80 

TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT^ GACAACATGC 1440 

GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1500 

GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 1560 

TAATAAAATG AGGAJ JVTTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1620 

GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1680 

GCGGTGGGCT CTATC JGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 17 40 

ACGTCAATGA CGGTi AATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 1800 

TTCCTACTTG GCAGlACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 1860 

GGCAGTACAT CAATqSGCGT GG ATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 1920 

3GAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 1980 

GTAACAACTC CGCCCCtATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 2 040 

TAAGCAGAGC TGGGT£*CGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 2100 

ATGGGTTGGA GCCTCAITCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CGCTAGCACC 2160 

AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 2220 

GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 22 80 

GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 23 40 

TCCCTCAGCA GCGTGGTpAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 2 4 00 

AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGCAGAGCC C AAATC TTGT 24 60 

GACAAAACTC ACACATG (fCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 2520 

TTCCTCTTCC CCCCAAA^lCC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 2580 

TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 2 640 

GGCGTGGAGG TGCATAAtAc CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 27 00 

CGTGTGGTCA GCGTCCTCPJC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 27 60 

TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 2820 

GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 28 80 

AACCAGGTCA GCCTGACCTG\ CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 29 40 

TGGGAGAGCA ATGGGCAGCC VGGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 3000 

GACGGCTCCT TCTTCCTCTA ^AGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 30 60 

AACGTCTTCT CATGCTCCGT C^VTGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 3120 

CTCTCCCTGT CTCCGGGTAA AT^GAGGATCC GTTAACGGTT ACCAACTACC TAGACTGGAT 3180 
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TCGTGACAAC ATGCGGCCGT GATATCTACG TATGATCAGC CTCGACTGTG CCTTCTAGTT 3240 

GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 3300 

CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3 3 60 

crfA^Wfcioici cid^tsaeiKifra **aapf\t*iXAt*A ttfi9AAete*at*E*A tt&A?"fe*t*ci*A csaqaavau^A 3 4bo 

10 GGCATGCTGG GGATGCGGTG GGCTCTATGG AAC C AGCTGG GGCTCGACAG CGCTGGATCT 3480 

CCCGATCCCC AGOTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 3 540 

AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 3600 

15 I 

AGACAGTGTT C TCflGC AC AG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 3 660 

GACTCCTAAG CCAGITGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 3720 

20 GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 37 80 

GCAGGAGCCA GGGCSJpAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 3 840 

CTGACATAGT TGTGTTGGGA GCTTGGATAG CTTGGACAGC TCAGGGCTGC GATTTCGCGC 3 900 

25 ] 

CAAACTTGAC GGCAATICCTA GCGTGAAGGC TGGTAGGATT TTATCCCCGC TGCCATCATG 3 960 

GTTCGACCAT TGAACTCCAT CGTCGCCGTG TCCCAAAATA TGGGGATTGG CAAGAACGGA 402 0 

30 GACCTACCCT GGCCTCCIGCT CAGGAACGAG. TTCAAGTACT TCCAAAGAAT GACCACAACC 4080 

TCTTCAGTGG AAGGTAzAaCA GAATCTGGTG ATTATGGGTA GGAAAACCTG GTTCTCCATT 4140 

CCTGAGAAGA ATCGACCTTT AAAGGACAGA ATTAATATAG TTCTCAGTAG AGAACTCAAA 4200 

35 \ 

GAACCACCAC GAGGAGCTCA TTTTCTTGCC AAAAGTTTGG ATGATGCCTT AAGACTTATT 4 260 

GAACAACCGG AATTGGCaKg TAAAGTAGAC ATGGTTTGGA TAGTCGGAGG CAGTTCTGTT 4 3 20 

40 TACCAGGAAG CCATGAATOA ACCAGGCCAC CTTAGACTCT TTGTGACAAG GATCATGCAG 4380 

GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 4440 

GAATACCCAG GCGTCCTCTq TGAGGTCCAG GAGGAAAAAG GCATCAAGTA TAAGTTTGAA 4500 

45 \ 

GTCTACGAGA AGAAAGACTA IACAGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 4560 

CTATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGATCAGCCT CGACTGTGCC 4 620 

50 TTCTAGTTGC CAGCCATCTG TaTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 4 6 80 

TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG 47 40 

GTGTCATTCT ATTCTGGGGG GTCGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4800 

55 \ 

CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 48 60 

TAGCTTTGCT TCTCAATTTC TTArTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 49 20 

60 ACCAATTCAG TAG TTG ATTG AGcAkATGCG TTGCCAAAAA GGATGCTTTA GAGACAGTGT 4980 

TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 5040 
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GCCAGTGAGT 


GGCACAGCAT 


TCTAGGGAGA 


AATATGCTTG 


TCATCACCGA 


AGCCTGATTC 


5100 


CGTAGAGCCA 


CACCTTGOTA 


AGGGCCAATC 


TGCTCACACA 


GGATAGAGAG 


GGCAGGAGCC 


5160 


AGGGCAGAGG 


ATATAAGGTG 


AGGTAGGATC 


AGTTGCTCCT 


CACATTTGCT 


TC TG AC AT AG 


5220 


TTGTGTTGGG 


agcttggatIc 


GATCCTCTAT 


GGTTGAACAA 


GATGGATTGC 


ACGCAGGTTC 


5280 


TCCGGCCGCT 




vJV#l— X ** X X 


v- X X vj/\t- X VJkJ 


f"£r* A P* A A A *4 A 

vav.ALAAt.AQA 


C AATC QCK C TQ 


33 40 


CTCTGATGCC 


gccgtgttcc 


GGCTGTCAGC 

VJV3\» X VJ X V_^VVJ\v 


GCAGGGGCGC 


ppggttptt't 1 

V.V.UVI X Xv*X 1 X 


1 I vj i UAAuAL 


C A A f\ 

d40U 


CGACCTGTCC 


ggtgccctga 


ATGAACTGCA 


GGACGAGGCA 


GCGCGGC TAT 

UwwVawww x n x 


V.VJ 1 VjVJV_ X vJVJV— 


jflOU 


CACGACGGGC 


gttccttgcq 


CAGCTGTGCT 


CGACGTTGTP 




par nn.n acttz 

VJrtrtVJVjVJi^V,. 1 Vj 


^ *5 n 


GCTGCTATTG 


ggcgaagtgc 


1 CGGGGCAGGA 


TCTCCTGTPA 

X V» X V.V. X VJ X f\ 


TPTPAPPTTP, 

X V. IV.AV.V. X XVj 


V- 1 v-L 1 vjV_V_ Vj/\ 


CCQA 

33oU 


GAAAGTATCC 


atcatggctg 


lATGCAATGCG 


GCGGCTGCAT 


ACGPTTGATP 

rtV- VJ\_ X X VJrt X V- 


PGGPTAPPTf^ 
V— vjvjv. X r\V_ v- X vj 


^ /I Pi 


CCCATTCGAC 


caccaagcga 


Kacatcgcat 


CGAGCGAGCA 


CGTACTCGGA 

v-VJ X X V— VJVJ /A 


TPP,AAP,PPP,P, 

X OVj/UAVjV_ v. VjVj 


C7nn 


TCTTGTCGAT 


caggatgatc 


TGGACGAAGA 


GCATCAGGGG 


v— X v.vjV,0\.v.A\J 


PPP1 APTHTT 
v.v,onnv. X vj X X 


J / ou 


CGCCAGGCTC 


aaggcgcgca 


TjGCCCGACGG 


CGAGGATCTC 


GTCGTGACPP 

vj x v» vj x wrvvw v^ 


ATGGPGATGP 
n x vjvjv^vjrA X vjv 


^ r 9 n 

J o ^ u 


CTGCTTGCCG 


aatatcatgg 


TGGAAAATGG 


PPGPTTTTPT 

\-> w\Jw X X X X \_ X 


GG A TTP A TPG 

VjVjrt X i v — c\ L \ — 


rt.V_ X Vj X vjvjV_V_vj 


coon 

JooU 


GCTGGGTGTG 


gcggaccgct 


ATCAGGACAT 


AGCGTTGGCT 

X X VJVJV-> X 


AC CCGTG AT A 

n\, v» v» vj X VJn X A 


TTGPTG A AGA 
X X vjv_ X vj^-vr\vji~i 


j y *± u 


GCTTGGCGGC 


gaatgggctg 


AOCGCTTCCT 


CGTGCTTTAC 


GGTATGGPGG 


PTPPPGATTP 


O VJ VJ vj 


GCAGCGCATC 


GCCTTCTATC 


GCCTTCTTGA 


CGAGTTPTTP 

\~\J.r\\J X X V_ X X v_ 


TttAP,PP,PP,AP 


TPTnnrinTTp 

X V_ 1 VJVJVJVJ 1 IV. 


n n 


GAAATGACCG 


ACCAAGCGAC 


GCuC AAC C TG 


PPATPAPGAG 


ATTTPGATTP 
e\ x X X v- X X V- 


v_>Vv_ V_ VJV_ V- VJ V- V- 


£ 1 9 Pi 


TTCTATGAAA 


GGTTGGGCTT 


CGG AATCG TT 


TTPPGGGAPG 

X X V_V — vJVJUrtv_VJ 


ppr,npTP,n a t 

V-V_VjVjV_ X VJVJrt. X 


P,ATPPTPPAP, 


1 ft P) 


CGCGGGGATC 


TCATGCTGGA 


GTTQTTCGCC 

VJ X X V4i X ^VJww 


rArrrrAAPT 


TriTTT a r v r vrzr % 

xox x ini x vjv_ 


APPTTAT A AT 
M.vjv_ 1 I.AI r\f\ X 


6 9 4 Pi 
a ^ f± U 


GGTTACAAAT 


AAA G fAA TAfi 


PATpkcA A AT 


TTPAPA A ATA 


A APPfi rnrprnrrirn 
AAvjCAJ. 1111 


i iv_AV_ lvjv_Al 


£ 1 Pt H 
O jUU 


IV — 1. X X Vj X Vj 


U 1 X IVj i V. V^rtJA 


.rt.l_ 1 Lril V_Arl X 


LlAiLl lAlL 


A TV* TV"! 1 /■**/""• A T> 


Cvjv_vjOv.CvjC.vj 


oJbU 


atppphtpp: a 


unuv x X vjVjv_vj 


I AA XLAl Vjvj I 


LAlAbL 1 vj I x 


ICLIvj rCjTCjA 


AATTGTTATC 




PP5PTP araiT 


X LLALALAAL 


t\ X ACvjiyjCCvj 


/*> A RPPATIA A A 

vj AAG C AT AAA 


GTGT AAAGC C 


TGGGG TGC C T 


6480 


A ATGAGTHAn 

n/A X X vj/Wj 


pt a a ptp ara 

V- 1 rT-r\\_ XV_AV_r\ 


1 1AA1 iuLfjl 


1 vjv_VjV, I LAL X 


Cjv_Lcvjv, i nc 


LAGTCGGGAA 


o b4 U 


ACCTGTCGTG 


C C AGP TGC AT 


X ftrt X VJAn X V. VJ 


nrpAArr,rr*p 

OLwAAUuLuw 


ClCXCXCX A Af^ fZP 


VjVj X X ioLol A 


DDUU 


TTGGGCGCTC 


TTCCGCTTCC 


TCGCTCACTG 

1 w\J\ri X Vw / x.H^ 1 VJ 


X V„.VjV_ XVjV»Vj 


PTPGPTPGTT 

v_ X v— OVj X V— vj X X 


PPPPTPPPPP 

V— VjVjV_ x VJV_VJVjV* 


6 660 


GAGCGGTATC 


AGCTCACTCA 


AAGGCGGlW 


TACGGTTATC 


CACAGAATCA 


GGGGATAACG 


6720 


CAGGAAAGAA 


CATGTGAGCA 


AAAGGCCAGC 


AAAAGGCCAG 


GAACCGTAAA 


AAGGCCGCGT 


6780 


TGCTGGCGTT 


TTTCCATAGG 


CTCCGCCCCfc 


CTGACGAGCA 


TCACAAAAAT 


CGACGCTCAA 


6840 


GTCAGAGGTG 


GCGAAACCCG 


acaggactat\ 


AAAGATACCA 


GGCGTTTCCC 


CCTGGAAGCT 


6900 


CCCTCGTGCG 


CTCTCCTGTT 


CCGACCCTGC 


CGCTTACCGG 


ATACCTGTCC 


GCCTTTCTCC 


6960 
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CTTCGGGAAG CGTGcbcCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 
TCGTTCGCTC CAAGC X3GGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 
TATCCGGTAA CTATCC ITCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 

CAOCCACTOO TAACAC QATT AQCAQAQCOA QQTATQTAOg CaOTOQTACA GA aTT C T T a A 

AGTGGTGGCC TAACTi CGGC T AC AC T AG AA GGACAGTATT TGGTATCTGC GCTCTGCTGA 
AGCCAGTTAC CTTCGC 1AAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 
GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG 
AAGATCCTTT GATCTT TTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 
GGATTTTGGT CATGA0 ATT A TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT 
GAAGTTTTAA ATCAATjCTAA AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT 
TAATCAGTGA GGCACCbATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 
tccccgtcgt gtagatAact ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 
TGATACCGCG AGACCcJcGC TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG 
GAAGGGCCGA GCGCAGAkGT GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT 
GTTGCCGGGA AGCTAGA*TA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGC C A 
TTGCTACAGG CATCGTGdTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT 
CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 



TCGGTCCTCC GATCGTTG r 'C 
CAGCACTGCA TAATTCTCT V 
AGTACTC AAC CAAGTCATT : 
CGTCAATACG GGATAATACC 
AACGTTCTTC GGGGCGAAA^ 
AACCCACTCG TGCACCCAAC 



AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG 
ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG 
TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 
GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA 
CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT 
TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT 



GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT 
GAATACTCAT ACTCTTCCTT TTTCAATATT ATTGAAGCAT TTATCAGGGT TATTGTCTCA 
TGAGCGGATA CATATTTGAA |TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT 
TTCCCCGAAA AGTGCCACCT 



7020 
7080 
7140 
7300 
7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 

8540 



(3) INFORMATION FO^ SEQ ID NO: 2: 
60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH\ 9209 bases 
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(ii) 
(iii) 
(iv) 
(ix) 



M( 



ant! 



TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: circular 

.ECULETYPE: DNA (genomic) 
(OTHETICAL: 

A 

SENSE: no 



SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



GACGTCGCGG CCGCTtTAGG 
AGGCCGAGGC GGCCTCGGCC 
GGAGAATGGG CGGAAC'AgGG 
ATGGTTGCTG ACTAATTffiAG 
GACTTTCCAC ACCTGGTT£C 
GGGGAGCCTG GGGACTTT 
AGTTATTAAT AGTAATCAA 
GTTACATAAC TTACGGTAAA 1 
ACGTCAATAA TGACGTATGT 
TGGGTGGACT ATTTACGGTA 
AGTACGCCCC CTATTGACGT 
ATGACCTTAT GGGACTTTCC 
ATGGTGATGC GG TTTTGGC A 
TTTCCAAGTC TCCACCCCAT 
GACTTTCCAA AATGTCGTAA 
CGGTGGGAGG TCTATATAAG 
CATCACAGAT CTCTCACTAT 
GCTTCAGTCA TAATGTCCAG 
GCATCTCCAG GGGAGAAGGT 
CACTGGTTCC AGCAGAAGCC 
CTGGCTTCTG GAGTCCCTGT 
ACAATCAGCA GAGTGGAGGC 
AACCCACCCA CGTTCGGAGG 
TCTGTCTTCA TCTTCCCGCC 



CCTCCAAAAA AGCCTCCTCA CTACTTCTGG 
TCTGCATAAA TAAAAAAAAT TAGTCAGCCA 
CGGAGTTAGG GGCGGGATGG GCGGAGTTAG 
ATGCATGCTT TGCATACTTC TGCCTGCTGG 
TGACTAATTG AGATGCATGC TTTGCATACT 
ACACCCTAAC TGACACACAT TCCACAGAAT 
TACGGGGTCA TTAGTTCATA GCCCATATAT 
TGGCCCGCCT GGCTGACCGC CCAACGACCC 
'CCCATAGTA ACGCCAATAG GGACTTTCCA 
-CTGCCCAC TTGGCAGTAC ATCAAGTGTA 
TGACGGT AAATGGCCCG CCTGGCATTA 
TicTTGGCAG TACATCTACG TATTAGTCAT 
GTACATCAAT GGGCGTGGAT AGCGGTTTGA 
tgAtgtcaat GGGAGTTTGT TTTGGCACCA 
caaAtccgcc CCATTGACGC AAATGGGCGG 
CAGAGCTGGG tacgtgaacc gtcagatcgc 

GGATTVTTCAG GTGC AG ATT A TCAGCTTCCT 

aggacAaatt gttctctccc AGTCTCCAGC 

CACAATtACT TGCAGGGCCA GCTCAAGTGT 
AGGATCGTCC CCCAAACCCT GGATTTATGC 
TCGCTTCAGT GGCAGTGGGT CTGGGACTTC 
TGAAGATGOT GCCACTTATT ACTGCCAGCA 
GGGGACCAAG CTGGAAATCA AACGTACGGT 
ATCTGATGAGXCAGTTGAAAT CTGGAACTGC 



AATAGCTCAG 
TGCATGGGGC 
GGGCGGGACT 
GGAGCCTGGG 
TCTGCCTGCT 
TAATTCCCCT 
GGAGTTCCGC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 
AAATCAACGG 
TAGGCGTGTA 
CTGGAGACGC 
GCTAATCAGT 
AATCCTGTCT 
AAGTTACATC 
CACATCCAAC 
TTACTCTCTC 
GTGGACTAGT 
GGCTGCACCA 
CTCTGTTGTG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
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TGCGTGCTGA 

CTCCAATCGG 

AGCCTCAGCA 

TGCGAAGTCA 
TGTTGAATTC 

GGCCGTGATA 

GTTTGCCCCT 

TAATAAAATG 

GGGGTGGGGC 

GCGGTGGGCT 

ACGTCAATGA 

TTCCTACTTG 

GGCAGTACAT 

CCATTGACGT 

GTAACAACTC 

TAAGCAGAGC 

ATGGGTTGGA 

GTACAACTGC 

TGCAAGGCTT 

GGTCGGGGCC 

CAGAAGTTCA 

CAGCTCAGCA 

TACGGCGGTG 

GCTAGCACCA 

GGCACAGCGG 

TGGAACTCAG 

GGACTCTACT 

TACATCTGCA 

AAATCTTGTG 

CCGTCAGTCT 

GAGGTCACAT 

TACGTGGACG 



ATAACTCTCTA 

GTAACTqCCA 

GCACCCT SAC 

CCCATCA 3GG 
AGATCCG TTA 

TCTACGT YTG 

CCCCCGT 

AGGAAAT 

AGGACAGdAA 

CTATGGAACC 

CGGTAAATQG 

GCAGTACAT 

CAATGGGCG 

CAATGGGAG 

CGCCCCATTG 

TGGGTACGTC 

GCCTCATCTT 

AGCAGCCTGG 

CTGGCTACAC 

TGGAATGGAT 

AAGGCAAGGC 

GCCTGACATC TGAGt 

ACTGGTACTT 

AGGGCCCATC 

CCCTGGGCTG 

GCGCCCTGAC 

CCCTCAGCAG 

ACGTGAATCA 

ACAAAACTCA 

TCCTCTTCCC 

GCGTGGTGGT 

GCGTGGAGGT 



GCCAAAGTAC 

ACAGAGCAGG 

GCAGACTACG 

CCCGTCACAA 
ACTACCTAGA 

ACTGTGCCTT 

CTGGAAGGTG 

CTGAGTAGGT 

TGGGAAGACA 

CGACAGCTAT 

ATTATGCCCA 

TCATCGCTAT 

TTGACTCACG 

• ACCAAAATCA 

'< GCGGTAGGCG 

GTGATCAGCA 

GTCGCTGTTG 

GTGAAGCCTG 

TACAATATGC 

1 TATCCCGGAA 

GCAGACAAAT 

GCGGTCTATT 

GGCGCAGGGA 

CTGGCACCCT 

GACTACTTCC 

CACACCTTCC 

GTGCCCTCCA 

AACACCAAGG 

CCGTGCCCAG 

AAGGACACCC 

CACGAAGACC 

AAGACAAAGC 

-70- 



AGTGGAAGGT 

ACAGCAAGGA 

AGAAACACAA 

AGAGCTTCAA 
CTGGATTCGT 

CTAGTTGCCA 

CCACTCCCAC 

GTCATTCTAT 

ATAGCAGGCA 

GCCAAGTACG 

GTACATGACC 

TACCATGGTG 

GGGATTTCCA 

ACGGGACTTT 

TGTACGGTGG 

CTGAACACAG 

CTACGCGTGT 

GGGCCTCAGT 

ACTGGGTAAA 

ATGGTGATAC 

CCTCCAGCAC" 

ACTGTGCAAG 

CCACGGTCAC 

CCTCCAAGAG 

CCGAACCGGT 

CGGCTGTCCT 

GCAGCTTGGG 

TGGACAAGAA 

CACCTGAACT 

TCATGATCTC 

CTGAGGTCAA 

CGCGGGAGGA 



GGATAACGCC 

CAGCACCTAC 

AGTCTACGCC 

CAGGGGAGAG 
GACAACATGC 

GCCATCTGTT 

TGTCCTTTCC 

TCTGGGGGGT 

TGCTGGGGAT 

CCCCCTATTG 

TTATGGGACT 

ATGCGGTTTT 

AGTCTCCACC 

CCAAAATGTC 

GAGGTCTATA 

ACCCGTCGAC 

CCTGTCCCAG 

GAAGATGTCC 

ACAGACACCT 

TTCCTACAAT 

AGCCTACATG 

ATCGACTTAC 

CGTCTCTGCA 

CACCTCTGGG 

GACGGTGTCG 

ACAGTCCTCA 

CACCCAGACC 

AGCAGAGCCC 

CCTGGGGGGA 

CCGGACCCCT 

GTTCAACTGG 

GCAGTACAAC 



1500 

1560 

1620 

1680 
1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 
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AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 
GAGTACAAGT GCAAGGTcW CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC 
AAAGCCAAAG GGCAGCCCdG AGAACCACAG GTGTACACCC TGCCCCCATC CCGGGATGAG 
CTGACCAAOA ACCACJQTCA 3 CCTOACCTQC CTOOTCAAAO OCTTCTAT C C CAQCOACA^ 
GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT ACAAGACCAC GCCTCCCGTG 
CTGGACTCCG ACGGCTCCT"! ' CTTCCTCTAC AGCAAGCTCA CCGTGGACAA GAGCAGGTGG 
CAGCAGGGGA ACGTCTTCTa ATGCTCCGTG ATGCATGAGG CTCTGCACAA CCACTACACG 
CAGAAGAGCC TCTCCCTGTcl TCCGGGTAAA TGAGGATCCG TTAACGGTTA CCAACTACCT 
AGACTGGATT CGTGACAACaI TGCGGCCGTG ATATCTACGT ATGATCAGCC TCGACTGTGC 
CTTCTAGTTG CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 
GTGCCACTCC CACTGTCCTT TCCTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 
GGTGTCATTC TATTCTGGGG GGTGGGGTGG GGCAGGACAG CAAGGGGGAG GATTGGGAAG 
ACAATAGCAG GCATGCTGGG G^TGCGGTGG GCTCTATGGA ACCAGCTGGG GCTCGACAGC 
GCTGGATCTC CCGATCCCCA GC^TTGCTTC TCAATTTCTT ATTTGCATAA TGAGAAAAAA 
AGGAAAATTA ATTTTAACAC CAaVtCAGTA GTTGATTGAG CAAATGCGTT GCCAAAAAGG 
ATGCTTTAGA GACAGTGTTC TCTgWaGA TAAGGACAAA CATTATTCAG AGGGAGTACC 
CAGAGCTGAG ACTCCTAAGC CAGTQAGTGG CACAGCATTC TAGGGAGAAA TATGCTTGTC 
ATCACCGAAG CCTGATTCCG TAGAG^CACA CCTTGGTAAG GGCCAATCTG CTCACACAGG 
ATAGAGAGGG CAGGAGCCAG GGCAGAfeCAT ATAAGGTGAG GTAGGATCAG TTGCTCCTCA 
CATTTGCTTC TGACATAGTT GTGTTGOGAG CTTGGATAGC TTGGACAGCT CAGGGCTGCG 
ATTTCGCGCC AAACTTGACG GCAATCC+AG CGTGAAGGCT GGTAGGATTT TATCCCCGCT 
GCCATCATGG TTCGACCATT GAACTGcArC GTCGCCGTGT CCCAAAATAT GGGGATTGGC 
AAGAACGGAG ACCTACCCTG GCCTCCGCic AGGAACGAGT TCAAGTACTT CCAAAGAATG 
ACCACAACCT CTTCAGTGGA AGGTAAACA^ AATCTGGTGA TTATGGGTAG GAAAACCTGG 
TTCTCCATTC CTGAGAAGAA TCGACCTTTA\ AAGGACAGAA TTAATATAGT TCTCAGTAGA 
GAACTCAAAG AACCACCACG AGGAGCTCAT yTTCTTGCCA AAAGTTTGGA TGATGCCTTA 
AGACTTATTG AACAACCGGA ATTGGCAAGT aVlAGTAGACA TGGTTTGGAT AGTCGGAGGC 
AGTTCTGTTT ACCAGGAAGC CATGAATCAA CcVgGCCACC TTAGACTCTT TGTGACAAGG 
ATCATGCAGG AATTTGAAAG TGACACGTTT TTCCCAGAAA TTGATTTGGG GAAATATAAA 
CTTCTCCCAG AATACCCAGG CGTCCTCTCT GAGGTCCAGG AGGAAAAAGG CATCAAGTAT 
AAGTTTGAAG TCTACGAGAA GAAAGACTAA CAGGAAGATG CTTTCAAGTT CTCTGCTCCC 



3420 
3480 
3540 

3000 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 
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CTCCTAAAGC TA*GCATTTT TATAAGACCA TGGGACTTTT GCTGGCTTTA GATCAGCCTC 
GACTGTGCCT TCtLttGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC 
CCTGGAAGGT GCcicTCCCA CTGTCCTTTC CTAATAAAAT GAGGAAATTG CATCGCATTG 
TCTGAGTAGG TGTOATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA AGGGGGAGGA 
TTGC3GAAGAC AATAeCAGGC ATGCTGGGGA TGCGG'I'UGGC TCTATGGAAC CAGCTGGGGC 
TCGAGCTACT AGCtWtt CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 
AATTTTAACA CCAAljrCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 
AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 
GACTCCTAAG CCAGTgUgTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 
GCCTGATTCC GTAGAGaCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 
GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 
CTGACATAGT TGTGTTGGGA GCTTGGATCG ATCCTCTATG GTTGAACAAG ATGGATTGCA 
CGCAGGTTCT CCGGCCGC*T GGGTGGAGAG GCTATTCGGC TATGACTGGG CACAACAGAC 
AATCGGCTGC TCTGATGCCa CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC CGGTTCTTTT 
TGTCAAGACC GACCTGTCCg\ GTGCCCTGAA TGAACTGCAG GACGAGGCAG CGCGGCTATC 
GTGGCTGGCC ACGACGGGCG WcCTTGCGC AGCTGTGCTC GACGTTGTCA CTGAAGCGGG 
AAGGGACTGG CTGCTATTGG dtCGAAGTGCC GGGGCAGGAT CTCCTGTCAT CTCACCTTGC 
TCCTGCCGAG AAAGTATCCA tAaTGGCTGA TGCAATGCGG CGGCTGCATA CGCTTGATCC 
GGCTACCTGC CCATTCGACC AcVaAGCGAA ACATCGCATC GAGCGAGCAC GTACTCGGAT 
GGAAGCCGGT CTTGTCGATC AGgVtGATCT GGACGAAGAG CATCAGGGGC TCGCGCCAGC 
CGAACTGTTC GCCAGGCTCA AGGCbcGCAT GCCCGACGGC GAGGATCTCG TCGTGACCCA 
TGGCGATGCC TGCTTGCCGA ATATClRTGGT GGAAAATGGC CGCTTTTCTG GATTCATCGA 
CTGTGGCCGG CTGGGTGTGG CGGAcAjCTA TCAGGACATA GCGTTGGCTA CCCGTGATAT 
TGCTGAAGAG CTTGGCGGCG AATGGGOTGA CCGCTTCCTC GTGCTTTACG GTATCGCCGC 
TCCCGATTCG CAGCGCATCG CCTTCTaIcG CCTTCTTGAC GAGTTCTTCT GAGCGGGACT 
CTGGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC CATCACGAGA TTTCGATTCC 
ACCGCCGCCT TCTATGAAAG GTTGGGCT'nC GGAATCGTTT TCCGGGACGC CGGCTGGATG 
ATCCTCCAGC GCGGGGATCT CATGCTGGA4 TTCTTCGCCC ACCCCAACTT GTTTATTGCA 
GCTTATAATG GTTACAAATA AAGCAATAGc\ ATCACAAATT TCACAAATAA AGCATTTTTT 
TCACTGCATT CTAGTTGTGG TTTGTCCAAA tTCATCAATC TATCTTATCA TGTCTGGATC 
GCGGCCGCGA TCCCGTCGAG AGCTTGGCGT A^TCATGGTC ATAGCTGTTT CCTGTGTGAA 
ATTGTTATCC GCTCACAATT CCACACAACA TA6pAGCCGG AAGCATAAAG TGTAAAGCCT 

-72- 



S280 

5340 

5400 

5460 
5520 

5580 

-5640 

5700 

5760 

5820 

5880 

5940 

6000 

6060 

6120 
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6900 

6960 

7020 

7080 

7140 
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GGGGTGCCTA OTGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC 7200 
AGTCGGGAAA CcVgTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG GGGAGAGGCG 72 60 

GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC TCGGTCGTTC 7320 

QGCTQCGOCa AGCdGTATQA GJCTCACTCAA AQaCOOTAAT ACQOTTATCC ACAOAATCAO 7 3 go 

10 GGGATAACGC AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA 7440 

AGGCCGCGTT GCTG<±:GTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC 7500 

GACGCTCAAG TCAGAfcGTGG CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC 75 60 

CTGGAAGCTC CCTCGtIgCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG 7620 

CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC ACGCTGTAGG TATCTCAGTT 7 680 

CGGTGTAGGT CGTTCGdTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC 7740 

GCTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC 7800 

CACTGGCAGC AGCCACTCfcT AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG 78 60 

AGTTCTTGAA GTGGTGGC&T AACTACGGCT ACACTAGAAG GACAGTATTT GGTATCTGCG 7 92 0 

CTCTGCTGAA GCCAGTTACt TTCGGAAAAA GAGTTGG TAG CTCTTGATCC GGCAAACAAA 7980 

30 CCACCGCTGG TAGCGGTGCA TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG 8040 

GATCTCAAGA AGATCCTTTGt ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT 8100 

CACGTTAAGG GATTTTGGTC UtGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA 8160 

ATTAAAAATG AAGTTTTAAA TCAATCTAAA G T AT AT ATG A GTAAACTTGG TCTGACAGTT 8220 

ACCAATGCTT AATCAGTGAG C^pACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG 82 80 

TTGC CTG ACT CCCCGTCGTG tAgaTAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA 8 340 

GTGC TGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC 8400 

AGCCAGCCGG AAGGGCCGAG CGQAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT 84 60 

CTATTAATTG TTGCCGGGAA GCtAgaGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG 8520 

TTGTTGCCAT TGCTACAGGC ATCGTfGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA 8580 

GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG 8640 

TTAGCTCCTT CGGTCCTCCG ATCGTTffiTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA 87 00 

TGGTTATGGC AGCACTGCAT AATTCTcWa CTGTCATGCC ATCCGTAAGA TGCTTTTCTG 87 60 

TG AC TGGTG A GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT 8820 

CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA 8880 

60 TCATTGGAAA ACGTTCTTCG GGGCGAAAAcV TCTCAAGGAT CTTACCGCTG TTGAGATCCA 894 0 

GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG 9000 
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r 
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TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC 
GGAAATGTTg\ AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT 
ATTGTCTCAT taAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC 
CGCGCACATT TCCCCGAAAA GTGCCACCT 



9060 
9120 
9180 
9209 



10 (4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

CA)\ LENGTH: 54 bases 

(B) \ TYPE: nucleic acid 

(C) \ STRANDEDNESS: single 

(D) \ TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: y& 

A. 

(iv) ANTI-&ENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



5 " ATC ACA GAT\ CTC TCA CCA TGG ATT TTC AGG TBC AGA TTA TCA GCT 
TC 3 ■ 



A 



(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULe\tYPE: DNA (genomic) 

(iii) HYPOTHETICAL: S&*~ 

(iv) ANTI-SENSE : \yes 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



50 



55 



5 ' TGC AGC ATC CGT ACG TOT GAT TTC CAG CTT 3 1 

(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 



30 
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LENGTH: 384 bases 
TYPE: nucleic acid 
STRANDEDNESS: single 
TOPOLOGY: linear 



MOLECULE TYPE: DNA (genomic) 



HYPOTHETICAL: «fi£^ 
A 

T-SENSE: no 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
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ATG 


GAT 


TTT 


CAG 


i GTG 


CAG 


ATT 


ATC 


AGC 


TTC 


CTG 


CTA 


ATC 


AGT 


GCT 


TCA 


GTC 


51 


ATA 


ATG 


TCC 


AGA 


IGGG 


CAA 


ATT 


GTT 


CTC 


TCC 


CAG 


TCT 


CCA 


GCA 


ATC 


CTG 


TCT 


102 


GCA 


TCT 


CCA 


GGG 


GAG 


AAG 


GTC 


ACA 


ATG 


ACT 


TGC 


AGG 


GCC 


AGC 


TCA 


AGT 


GTA 


153 


AGT 


TAC 


ATC 


CAC 


TCG 


TTC 


CAG 


CAG 


AAG 


CCA 


GGA 


TCC 


TCC 


CCC 


AAA 


CCC 


TGG 


204 


ATT 


TAT 


GCC 


ACA 


TCC 


AAC 


CTG 


GCT 


TCT 


GGA 


GTC 


CCT 


GTT 


CGC 


TTC 


AGT 


GGC 


255 


AGT 


GGG 


TCT 


GGG 


ACIT 


TCT 


TAC 


TCT 


CTC 


ACA 


ATC 


AGC 


AGA 


GTG 


GAG 


GCT 


GAA 


306 


GAT 


GCT 


GCC 


ACT 


TAir 


TAC 


TGC 


CAG 


CAG 


TGG 


ACT 


AGT 


AAC 


CCA 


CCC 


ACG 


TTC 


357 


GGA 


GGG 


GGG 


ACC 


AA<£ 


CTG 


GAA 


ATC 


AAA 


















384 



(7) INFORMATION FOR SEQ ID NO: 6: 



(i) 



SEQUENCE CHARACTERISTICS: 

TG 
E: 



(A) 
(B) 
(C) 
(D) 



LENGTH: 27 bases 
nucleic acid 
STRANDEDNESS: single 
TOHOLOGY: linear 



(ii) 
(iii) 
(iv) 
(ix) 



MOLECULE TYPE: DNA (genomic) 
HYPOTHETICAL: 
ANTI-SENSE: no 

SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



50 



5 ' GCG GCT CCC ACGl CGT GTC CTG TCC CAG 3 1 



27 
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(8) INFORMATION FOR SEQ ID NO: 7: 
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(i) 



SEQUENCE CHARACTERISTICS: 



(A) 
(B) 
(C) 



ENGTH: 29 bases 

fYPE: nucleic acid 

$TRANDEDNESS: single 
"OPOLOOY: linear 



(ii) 
(iii) 
(iv) 
(ix) 



MOLECULE TYPE: DNA (genomic) 
HYPO r 

ANTI-SfeNSE: yes 
SEQUE tfCE DESCRIPTION: SEQ ID NO: 7: 



HETICAL: 



(i) 



5* GGS TGT TGT GCT AGC TGM RGA GAC RGT GA 3' 



(9) INFORMATION FOR SEQ ID NO: 8: 



29 



SEQUENCE CHARACTERISTICS: 



(A) 
(B) 
(C) 
(D) 



NGTH: 420 bases 
TYPE: nucleic acid 



s r ?: 



RANDEDNESS: single 



(ii) 
(Hi) 
(iv) 
(ix) 



ATG GGT 

CTG TCC 

GCC TCA 

AAT ATG 

GCT ATT 

AAG GCC 

AGC AGC 

TAC TAC 
ACC GTC 



TGG AGC 

CAG GTA 

GTG AAG 

CAC TGG 

TAT CCC 

ACA TTG 

CTG ACA 

GGC GGT 
TCT GCA 



TOPOLOGY: linear 
MOLECjJLE TYPE: DNA (genomic) 

no 

HYPOTHETICAL: yesr*— 
ANTI-SEjNSE: no 

SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



ATC 
CTG 

rcc 



CTC 
CAA 
ATG 
GTA 
GGA AlVT 
ACT GuA 
TCT G.PG 
GAC TCG 



TTG CTC 
CAG CAG 
TGC AAG 
CAG ACA 
GGT GAT 
GAC AAA 
GAC TCT 
TAC TTC 



TTC CTT 

CCt GGG 
GCT TCT 
CCT GGT 
ACT TCC 
TCC TCC 
GCG GTC 
AAT GTC 



GTC 


GCT 


GTT 


GCT 


ACG 


CGT 


GTC 


51 


GCT 


GAG 


CTG 


GTG 


AAG 


CCT 


GGG 


102 


GGC 


TAC 


ACA 


TTT 


ACC 


AGT 


TAC 


153 


CGG 


GGC 


CTG 


GAA 


TGG 


ATT 


GGA 


204 


TAC 


AAT 


CAG 


AAG 


TTC 


AAA 


GGC 


•255 


AGC 


ACA 


GCC 


TAC 


ATG 


CAG 


CTC 


305 


TAT 


TAC 


TGT 


GCA 


AGA 


TCG 


ACT 


357 


TGG 


GGC 


GCA 


GGG 


ACC 


ACG 


GTC 


403 
420 
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